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Abstract 

Clustering under most popular objective functions is NP-hard, even to approximate well, and so unlikely to be efficiently 
solvable in the worst case. Recently, Bilu and Linial [TT[ suggested an approach aimed at bypassing this computational 
barrier by using properties of instances one might hope to hold in practice. In particular, they argue that instances 
in practice should be stable to small perturbations in the metric space and give an efficient algorithm for clustering 
instances of the Max-Cut problem that are stable to perturbations of size 0{n x / 2 ). In addition, they conjecture that 
instances stable to as little as O(l) perturbations should be solvable in polynomial time. In this paper we prove that 
. this conjecture is true for any center-based clustering objective (such as A;-median, fc-means, and A:-center). Specifically, 
we show we can efficiently find the optimal clustering assuming only stability to factor-3 perturbations of the underlying 
metric in spaces without Steiner points, and stability to factor 2 + y/3 perturbations for general metrics. In particular, 
we show for such instances that the popular Single-Linkage algorithm combined with dynamic programming will find 
the optimal clustering. We also present NP-hardness results under a weaker but related condition. 

Keywords: Clustering, fc-median, /c-means, Stability Conditions 



1. Introduction 

Problems of clustering data arise in a wide range of 
different areas - clustering proteins by function, cluster- 
ing documents by topic, and clustering images by who or 
what is in them, just to name a few. In this paper we focus 
on the popular class of center based clustering objectives, 
such as fc-median, fc-center and fc-means. Under these ob- 
jectives we not only partition the data into k subsets, but 
we also assign k special points, called the centers, one in 
each cluster. The quality of a solution is then measured 
as a function of the distances between the data points and 
their centers. For example, in the fc-median objective, the 
goal is to minimize the sum of distances of all points from 
their nearest center, and in the fc-means objective, we min- 
imize the sum of the same distances squared. As these are 
NP-hard problems (TTJ [TS1 [UJ , there has been substantial 
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work on approximation algorithms [H [3J |H1 HH HH1 [H| with 
both upper and lower bounds on approximability of these 
and other objective functions. Note that we are especially 
interested in the case that k is part of the input and not a 
constant. 

Recently, Bilu and Linial [TT], focusing on the Max- 
Cut problem [IB], proposed considering instances where 
the optimal clustering is optimal not only under the given 
metric, but also under any bounded multiplicative pertur- 
bation of the given metric. This is motivated by the fact 
that in practice, distances between data points are typi- 
cally just the result of some heuristic measure (e.g., edit- 
distance between strings or Euclidean distance in some 
feature space) rather than true "semantic distance" be- 
tween objects. Thus, unless the optimal solution on the 
given distances is correct by pure luck, it likely is correct 
on small perturbations of the given distances as well. Bilu 
and Linial analyze Max-Cut instances of this type and 
show that for instances that are stable to perturbations 
of multiplicative factor roug hly 0{n^ 2 ), one can retrieve 
the optimal Max-Cut in polynomial time. However, they 
conjecture that stability up to only constant magnitude 
perturbations should be enough to solve the problem in 
polynomial time. In this paper we show that this conjec- 
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ture is indeed true for fc-median and fc-means objectives 
and in fact for any well-behaved center-based objective 



function (see Definition 1.3 1. 



1.1. Main Result 

First, let us formally define the notion due to [IT] of 
stability under multiplicative perturbations, stated in this 
context. 

Definition 1.1. Given a metric (S,d), and a > 1, we say 

a function d' : S x S — > M>o is an a-perturbation of d, if 
for any x,y € S it holds that 

d(x,y) < d'(x,y) < ad(x,y) 

Note that d! may be any non-negative function, and 
need not be a metric. 

Definition 1.2. Suppose we have a clustering instance 
composed of n points residing in a metric (S, d) and an 
objective function $ we wish to optimize. We call the clus- 
tering instance a-perturbation resilient for $ if for any d! 
which is an a-perturbation of d, the (only) optimal cluster- 
ing of (S,d r ) under 3> is identical, as a partition of points 
into subsets, to the optimal clustering of (S, d) under 

We will in particular be concerned with separable, center- 
based clustering objectives (which include fc-median, fc- 
means, and fc-center among others). 

Definition 1.3. A clustering objective is center-based if 
the optimal solution can be defined by k points c\, . . . ,c* k in 
the metric space called centers such that every data point is 
assigned to its nearest center. Such a clustering objective 
is separable if it furthermore satisfies the following two 
conditions: 

• The objective function value of a given clustering is 
either a (weighted) sum or the maximum of the in- 
dividual cluster scores. 

• Given a proposed single cluster, its score can be com- 
puted in polynomial time. 

Our main result is that we can efficiently find the opti- 
mal clustering for perturbation-resilient instances of sepa- 
rable center-based clustering objectives. In particular, we 
get an efficient algorithm for 3-perturbation-resilient in- 
stances when the metric S is defined only over data points, 
and for (2 + \/3)-perturbation-resiliant instances for gen- 
eral metrics. 

Theorem 1.4. For a > 3 (in the case of finite metrics 
defined only over the data) or a > 2 + \/3 (for general 
metrics), there is a polynomial-time algorithm that finds 
the optimal clustering of a-perturbation resilient instances 
for any given separable center-based clustering objective. 



The algorithm, described in Section |2.2[ turns out to 
be quite simple. As a first step, it runs the classic single- 
linkage algorithm, but unlike the standard approach of 
halting when fc clusters remain, it runs the algorithm un- 
til all points have been merged into a single cluster and 
keeps track of the entire tree-on-clusters produced^ Then, 
the algorithm's second step is to apply dynamic program- 
ming to this hierarchical clustering to identify the best 
fc-clustering that is present within the tree. Using a result 
of Balcan et al. [6] we show that the resulting clustering 
obtained is indeed the optimal one. Albeit being very dif- 
ferent, our approach resembles, in spirit, the work of Bar- 
tal [7], Abraham et al [JJ and Racke [22] in the sense that 
we reduce the problem of retrieving an optimal solution 
from a general instance to a tree-like instance (where it is 
poly-time solvable). 

Our algorithms use only a weaker property, which we 



call center-proximity (see Section 2.1 1, that is implied by 



perturbation-resilience. We then complement these results 
with a lower bound showing that for the problem of fc- 
median on general metrics, for any e > 0, there exist NP- 
hard instances that satisfy (3 — e)-center proximityj^] 

1.2. Related work 

There have been a number of investigations of differ- 
ent notions of stability for the problem of clustering. For 
example, Ostrovsky et al. [2TJ consider a fc-means instance 
to be stable if the optimal fc-clustering is substantially 
cheaper than the optimal (fc — l)-clustcring under this ob- 
jective. They present an efficient algorithm for finding 
near-optimal fc-means clusterings when this gap is large, 
and these results were subsequently strengthened to apply 
to smaller gaps in Balcan et al. [5] consider instead 
a clustering instance to be stable if good approximations 
to the given objective are guaranteed to be close, as clus- 
terings, to a desired ground-truth partitioning. This is 
motivated by the fact that when the true goal is to match 
some unknown correct answer (e.g., to correctly cluster 
proteins by their function), this is an implicit assumption 
already being made when viewing approximation ratio as 
a good performance measure. Balcan et al. [5] show that 
in fact this condition can be used to bypass approximation 
hardness results for a number of clustering objectives in- 
cluding fc-median and fc-means. Here they show that if all 
(1 + ^-approximations to the objective are <5-close to the 
desired clustering in terms of how points are partitioned, 
then one can efficiently get 0(<5/a)-close to the desired 
clustering. Ben-David et al. [TU1 [H] consider a notion of 



1 The example depicted in Figure [3] proves that indeed, halting 
the Single-Linkage algorithm once k clusters are formed may fail on 
certain ct-perturbation resilient instances. 

2 We note that while our belief was that allowing Steiner points 
in the lower bound was primarily a technicality, Balcan et al. (M.F. 
Balcan, personal communication) have recently shown this is not the 
case, giving a clever algorithm that finds the optimal clustering for 
fc-median instances in finite metrics when a = 1 + \[2. 
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stability of a clustering algorithm, which is called stable 
if it outputs similar clusters for different sets of n input 
points drawn from the same distribution. For fc-means, 
the work of Meila [30] discusses the opposite direction - 
classifying instances where an approximated solution for 
fc-means is close to the target clustering. 

2. Proof of Main Theorem 

2. 1 . Properties of Perturbation Resilient Instances 

We begin by deriving other properties which every a- 
perturbation resilient clustering instance must satisfy. 

Definition 2.1. Let p £ S be an arbitrary point, let c* be 
the center p is assigned to in the optimal clustering, and let 
c* 7^ c* be any other center in the optimal clustering. We 
say a clustering instance satisfies the a-center proximity 
property if for any p it holds that 

d(p,c*) > ad(p,c*) 

Fact 2.2. If a clustering instance satisfies the a -perturbation 
resilience property, then it also satisfies the a-center prox- 
imity property. 

Proof. Let C* and C* be any two clusters in the opti- 
mal clustering and pick any p £ C* . Assume we blow up 
all the pairwise distances within cluster C* by a factor 
of a. As this is a legitimate perturbation of the metric, 
it still holds that the optimal clustering under this per- 
turbation is the same as the original optimum. Hence, p 
is still assigned to the same cluster. Furthermore, since 
the distances within C* were all changed by the same 
constant factor, c* will still remain an optimal center of 
cluster i. The same holds for cluster C* . It follows that 
even in this perturbed metric, p prefers c* to c*. Hence 
ad(p, c* ) = d' (p, c* ) < d' (p, c* ) = d{p, c* ) . □ 

Corollary 2.3. For every point p and its center c* , and 
for every point p' from a different cluster, it follows that 
d{p,p')>{a-l)d{p,c*). 

Proof. Denote by c* the center of the cluster that p' be- 
longs to. Now, consider two cases. Case (a): d(p',c*) > 
d(jp,c*). In this case, by traingle inequality we get that 
d{p,p') > d(p',c*) — d{p,c*). Since the data instance is 



stable to a-perturbations, Fact 2.2 gives us that d(p' , c*) > 
ad(p', c*). Hence we get that d(p,p') > ad{p' ' , c*) — d(p, c*) 
> (a - \)d{p,c*). Case (b): d(p',c*) < d(p,c*). Again 
by traingle inequality we get that d(p,p') > d(p,c*) — 
d(p', c*) > adip, c*) - dip', c*) > (a - l)dip, c*). □ 

A key ingredient in the proof of Thcorcm |1.4| is the tree- 
clustering formulation of Balcan et. al [6 . In particular, 
we prove that if an instance satisfies a-center proximity for 
a > 3 (in the case of finite metrics without Steiner points) 
or for a > 2 + v3 (for general metrics) then it also sat- 
isfies the "min-stability property" (defined below). The 



min-stability property, as shown in [B], is sufficient (and 
necessary) for the Single-Linkage algorithm to produce a 
tree such that the optimal clustering is some pruning of 
this tree. In order to define the "min-stability" property, 
we first introduce the following notation. For any two sub- 
sets A, B C S, we denote the minimum distance between 
A and B as d min (A, B) = min{d(a, b) \ a £ A, b £ B}. 

Definition 2.4. A clustering instance satisfies the min- 
stability property if for any two clusters C and C in the 
optimal clustering, and any subset A C C , it holds that 
d min iA,C\A)<d min iA,C). 

In words, the min-stability property means that for 
any set A that is a strict subset of some cluster C in the 
optimal clustering, the closest point to A is a point from 
C \ A, and not from some other cluster. The next two 
lemmas lie at the heart of our algorithm. 

Lemma 2.5. A clustering instance in which centers must 
be data points that satisfies a-center proximity for a > 3 
(for a center-based clustering objective), also satisfies the 
min-stability property. 

Proof. Let C* , C* be any two clusters in the target clus- 
tering. Let A and A' be any two subsets s.t. A C C* and 
A' C C* . Let p e A and p' € A' be the two points which 
obtain the minimum distance d m - ln iA, A'). Let q G C* \ A 
be the nearest point to p. Also, denote by c* and c* the 
centers of clusters C* and C* respectively. 

For the sake of contradiction, assume that d m j n (^4, C* \ 
A) > d min iA,A'). Suppose c* £ A. This means that 
dip,!/) = d min iA,A') < d min iA,C*\A) < dip,c*). As 
a > 3, this contradicts Corollary |2.3[ 

Thus we may assume c* € A. It follows that d(g, c*) > 
dip,p') > (3-l)d(p,cf) = 2d(p,cf), so dip, cf) < d(q,c*)/2. 
We therefore have that d(p',c*) < dip,p') + dip, c*) < 
3(2(g,c*)/2. This implies that d{p',cf) < dip',c*)/a < 
diq,c*)/2, and thus d(q, c*) < %, c*)+d{c*,p)+djp,p') + 
dip', c*) < 3diq, c*) < ad{q, c*). This contradicts Fact 



2.2 



Lemma 2.6. A clustering instance in which centers need 
not be data points that satisfies a-center proximity for 
a > 2 + (for a center-based clustering objective), also 
satisfies the min-stability property. 



Proof. As in the proof of Lemma 2.5 let C*,C* be any 
two clusters in the target clustering and let A and A' be 
any two subsets s.t. A C C* and A' C C* . Let p e A 
and p' £ A' be the two points which obtain the minimum 
distance d m i n (A, A') and let q £ C* \ A be the nearest 
point to p. Also, as in the proof of Lemma 12.51 let c* 



c* denote the centers of clusters C* 



and 



and C* respectively 



(though these need not be datapoints). 

By definition of center-proximity, we have the following 
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inequalities: 

d(p,p') + d(p , c*) > ad(p, c*) [c. p. applied to p] 
d(p,p') + d(p,c*) > ad(p',c*) [c. p. applied to p'] 
d(p,p') +d(p',c*j) +d(p,q) > a(d(q,p) - d(p,c*)) 
[center proximity applied to q and triangle ineq.] 

Multiplying the first inequality by 1 — — , the sec- 
ond by , the third by -^—^ , and summing them together 
we get 

d(p,p') > ^^d(p,c*)+d(q, P ), 

which for a = 2 + \/3 implies d(p,p') > d(q,p) as desired. 

□ 

2.2. The Algorithm 

As mentioned, Balcan et al [5J proved (Theorem 2) 
that if an instance satisfies min-stability, then the tree on 
clusters produced by the single-linkage algorithm contains 
the optimal clustering as some fc-pruning of it. I.e., the 
tree produced by starting with n clusters of size 1 (viewed 
as leaves), and at each step merging the two clusters C,C" 
minimizing c? m ; n (C, C) (viewing the merged cluster as their 
parent) until only one cluster remains. Given the struc- 
tural results proven above, our algorithm (see Figure [T]) 
simply uses this clustering tree and finds the best fc-pruning 
using dynamic programming. 

Proof of Theorem \l-4\ By Lemmas |2.5| and |2.6[ the data 
satisfies the min-stability property, which as shown in [B] 
is sufficient to guarantee that some pruning of the single- 
linkage hierarchy is the target clustering. We then find the 
optimal clustering using dynamic programming by exam- 
ining fc-partitions laminar with the single-linkage cluster- 
ing tree. The optimal fc-clustering of a tree-node is either 
the entire subtree as one cluster (if fc = 1), or the mini- 
mum over all choices of fci-clusters over its left subtree and 
fc2-clusters over its right subtree (if fc > 1). Here fci, hi are 
positive integers, such that ki + k 2 = fc. Therefore, we just 
traverse the tree bottom-up, recursively solving the clus- 
tering problem for each tree-node. By assumption that the 
clustering objective is separable, so each step including the 
base-case can be performed in polynomial time. For the 
case of fc-median in a finite metric, for example, one can 
maintain a n x O(n) table for all possible centers and all 
possible clusters in the tree, yielding a running time of 
0(n 2 + nk 2 ). For the case of fc-means in Euclidean space, 
one can compute the cost of a single cluster by computing 
the center as just the average of all its points. In general, 
the overall running time is 0(n(k 2 + T(n))), where T(n) 
denotes the time it takes to compute the cost of a single 
cluster. □ 

2.3. Some Natural Barriers 

We complete this section with a discussion of barriers 
of our approach. First, our algorithm indeed fails on some 



finite metrics that are (3— e)-perturbation resilient. For ex- 
ample, consider the instance shown in Figure|2] In this in- 
stance, the clustering tree produced by single- linkage is not 
laminar with the optimal fc-median clustering. It is easy 
to check that this instance is resilient to a-perturbations 
for any a < 3. 

2 1 a-1 1 
• • • ^ •. 

q c p p c 

Figure 2: A finite metric fc-median instance with 2 < a < 3 
where our algorithm fails. The optimal 2-median clustering is 
{c,p, q}, {c',p'}. In contrast, when we run our algorithm over 
on this instance, single linkage first connects {c, p} with {c', p'}, 
and only then merges these 4 points with q. 

Second, observe that our analysis, though emanating 
from perturbation resilience, only uses center proximity. 
We next show that for general metrics, one cannot hope 
to solve (in poly-time) fc-median instances satisfying oo- 
center proximity for a < 3. This is close to our upper 
bound of 2 + \/S for general metrics. 

Theorem 2.7. For any a < 3, the problem of solving k- 
median instances over general metrics that satisfy a-center 
proximity is TSIP-hard. 

Proof. The proof of Theorem 1 2 . 7| follows from the classical 
reduction of Max-fc-Coverage to fc-median. In this reduc- 
tion, we create a bipartite graph where the right-hand side 
vertices represent the elements in the ground set; the left- 
hand side vertices represent the given subsets; and the 
distance between the set-vertex and each element-vertex 
is 1, if the set contains that element. Using shortest-path 
distances, it follows that the distance from any element- 
vertex to a set-vertex to which it does not belong to is at 
least 3. Using the fact that the NP-hardness results for 
Max-fc-Coverage holds for disjoint sets (i.e. the optimal 
solution of Yes-instances is composed of fc disjoint sets, 
see [15] ). the a-center proximity property follows. □ 

Lastly, we comment that using Single- Linkage in the 
usual way (namely, stopping when there are fc clusters re- 
maining) is not sufficient to produce a good clustering. We 
demonstrate this using the example shown in Figure [3j 
Observe, in this instance, since C contains significantly 
less points than A,B, or D, this instance is stable - even 
if we perturb distances by a factor of 3, the cost of any al- 
ternative clustering is higher than the cost of the optimal 
solution. However, because d(A, C) > d(B,D), it follows 
that the usual version of Single-Linkage will unite B and 
D, and only then A and C. Hence, if we stop the Single- 
Linkage algorithm at fc = 3 clusters, we will not get the 
desired clustering. 
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1. Run Single-Linkage until only one cluster remains, producing the entire tree on clusters. 

2. Find the best fc-pruning of the tree by dynamic programming using the equality 

best-fc-pruning(T) = min {best-fc'-pruning(T's left child) + bcst-(fc — fc')-pruning(T's right child)} 



Figure 1: Algorithm to find the optimal fc-clustering of instances satisfying a-center proximity. The algorithm is described for the 
case (as in fc-median or fc-means) that $ defines the overall score to be a sum over individual cluster scores. If it is a maximum 
(as in fc-center) then replace "+" with "max" above. 




Figure 3: An example showing failure of the usual version of 
Single-Linkage. The instance is composed of 4 components, 
each with inner-distance e and outer-distance as described in 
the figure. However, components A, B and D each contain 100 
points, whereas component C has only 10 points. The optimal 
3-median clustering consists of 3 clusters: {A, C}, {B}, {D} 
and has cost OPT = 200 + 300e. 

3. Open Problems 

There are several natural open questions left by this 
work. First, can one reduce the perturbation factor a 
needed for efficient clustering? As mentioned earlier, re- 
cently Balcan ct al. (M.F. Balcan, personal communica- 
tion) have given a very interesting algorithm that reduces 
the a = 3 factor needed by our algorithm for finite met- 
rics to 1 + \/2. Can one go farther, perhaps by using fur- 
ther implications of perturbation-resilience beyond center- 
proximity? Alternatively, if one cannot find the optimal 
clustering for small values of a, can one still find a near- 
optimal clustering, of approximation ratio better than what 
is possible on worst-case instances? 

In a different direction, one can also consider relax- 
ations of the perturbation-resilience condition. For exam- 
ple, Balcan et al. (personal communication) also consider 
instances that are "mostly resilient" to a-perturbations: 
under any a-perturbation of the underlying metric, no 
more than a <5-fraction of the points get mislabeled un- 
der the optimal solution. For sufficiently large constant a 
and sufficiently small constant 8, they present algorithms 
that get good approximations to the objective under this 
condition. A different kind of relaxation would be to con- 
sider a notion of resilience to perturbations on average: a 
clustering instance whose optimal clustering is likely not 
to change, assuming the perturbation is random from some 



suitable distribution. Can this weaker notion be used to 
still achieve positive guarantees? 

References 

[1] Ittai Abraham, Yair Bartal, T-H. Hubert Chan, Kcdar Dhamd- 
here Dhamdhere, Anupam Gupta, Jon Klcinbcrg, Ofcr Nciman, 
and Aleksandrs Slivkins. Metric embeddings with relaxed guar- 
antees. In Proc. 46th Annual IEEE Symp. Foundations of Com- 
puter Science (FOCS), 2005. 

[2] Sanjccv Arora, Prabhakar Raghavan, and Satish Rao. Ap- 
proximation schemes for euclidean k-medians and related prob- 
lems. In Proc. 30th Annual ACM Symp. Theory of Computing 
(STOC), 1998. 

[3] Vijay Arya, Naveen Garg, Rohit Khandekar, Adam Meyerson, 
Kamesh Munagala, and Vinayaka Pandit. Local search heuristic 
for k-median and facility location problems. In Proc. 33rd ACM 
Symp. Theory of Computing (STOC), 2001. 

[4] Pranjal Awasthi, Avrim Blum, and Or Shcffct. Stability yields a 
ptas for fc-median and fc-means clustering. In Proc. 51st Annual 
IEEE Symp. Foundations of Computer Science (FOCS), 2010. 

[5] Maria-Fiorina Balcan, Avrim Blum, and Anupam Gupta. Ap- 
proximate clustering without the approximation. In Proc. 19th 
Annual ACM-SIAM Symp. Discrete Algorithms (SODA), 2009. 

[6] Maria-Fiorina Balcan, Avrim Blum, and Santosh Vcmpala. 
A discriminative framework for clustering via similarity func- 
tions. In Proc. 40th Annual ACM Symp. Theory of Computing 
(STOC), 2008. 

[7] Yair Bartal. On approximating arbitrary metrices by tree met- 
rics. In Proc. 30th Annual ACM Symp. Theory of Computing 
(STOC), 1998. 

[8] Yair Bartal, Moses Charikar, and Danny Raz. Approximating 
min-sum k-clustering in metric spaces. In Proc. 33rd Annual 
ACM Symp. Theory of Computing (STOC), 2001. 
[9] Shai Ben-David, David Pal, and Hans-Ulrich Simon. Stability 
of fc-means clustering. In COLT, pages 20-34, 2007. 

[10] Shai Ben-David, Ulrike von Luxburg, and David Pal. A sober 
look at clustering stability. In Gabor Lugosi and Hans-Ulrich 
Simon, editors, COLT, volume 4005 of Lecture Notes in Com- 
puter Science, pages 5-19. Springer, 2006. 

[11] Yonatan Bilu and Nati Linial. Are stable instances easy? 1st 
Symp. Innovations in Computer Science (ICS), 2010. 

[12] Moses Charikar, Sudipto Guha, Eva Tardos, and David B. 
Shmoys. A constant-factor approximation algorithm for the 
k-median problem. In Proc. 31st Annual ACM Symp. Theory 
of Computing (STOC), 1999. 

[13] Sanjoy Dasgupta. The hardness of k-means clustering. Techni- 
cal report, University of California at San Diego, 2008. 

[14] W. Fernandez de la Vega, Marek Karpinski, Claire Kenyon, 
and Yuval Rabani. Approximation schemes for clustering prob- 
lems. In Proc. 35th Annual ACM Symp. Theory of Computing 
(STOC), 2003. 

[15] Uriel Feige. A threshold of Inn for approximating set cover. 
JACM, 45:314-318, 1998. 

[16] Michael R. Garey and David S. Johnson. Computers and In- 
tractability; A Guide to the Theory of NP- Completeness. W. 
H. Freeman & Co., New York, NY, USA, 1990. 



5 



[17] Sudipto Guha and Samir Khuller. Greedy strikes back: Im- 
proved facility location algorithms. In Journal of Algorithms, 
pages 649-657, 1998. 

[18] Kamal Jain, Mohammad Mahdian, and Amin Saberi. A new 
greedy approach for facility location problems (extended ab- 
stract). In Proc. 34th Annual ACM Symp. Theory of Comput- 
ing (STOC), pages 731-740, 2002. 

[19] Amit Kumar, Yogish Sabharwal, and Sandccp Sen. A simple lin- 
ear time (1+ e) -approximation algorithm for k-means clustering 
in any dimensions. In Proc. 45th Annual IEEE Symp. Founda- 
tions of Computer Science (FOCS), 2004. 

[20] Marina Meila. The uniqueness of a good optimum for k-means. 
In Proc. 23rd International Conference on Machine Learning 
(ICML), pages 625-632, 2006. 

[21] Rafail Ostrovsky, Yuval Rabani, Leonard J. Schulman, and 
Chaitanya Swamy. The effectiveness of Lloyd-type methods for 
the fc-mcans problem. In Proc. 47th Annual IEEE Symp. Foun- 
dations of Computer Science (FOCS), pages 165-176, 2006. 

[22] Harald Rackc. Optimal hierarchical decompositions for con- 
gestion minimization in networks. In Proc. 40th Annual ACM 
Symp. Theory of Computing (STOC), 2008. 



